Search CORE

3 research outputs found

Scalable Graph Algorithms using Practically Efficient Data Reductions

Author: Lamm Sebastian Emanuel
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 08/08/2022
Field of study

KITopen

Efficient Parallel Random Sampling : Vectorized, Cache-Efficient, and Online

Author: Dachsbacher Carsten
Hübschle-Schneider Lorenz
Lamm Sebastian
Sanders Peter
Schrade Emanuel
Publication venue: Association for Computing Machinery
Publication date: 14/03/2018
Field of study

We consider the problem of sampling

n

numbers from the range

\{1,\ldots,N\}

without replacement on modern architectures. The main result is a simple divide-and-conquer scheme that makes sequential algorithms more cache efficient and leads to a parallel algorithm running in expected time

\mathcal{O}(n/p+\log p)

p

processors, i.e., scales to massively parallel machines even for moderate values of

n

. The amount of communication between the processors is very small (at most

\mathcal{O}(\log p)

) and independent of the sample size. We also discuss modifications needed for load balancing, online sampling, sampling with replacement, Bernoulli sampling, and vectorization on SIMD units or GPUs

arXiv.org e-Print Archive

KITopen

Thrill: High-performance algorithmic distributed batch data processing with C++

Author: Axtmann Michael
Bingmann Timo
Jobstl Emanuel
Lamm Sebastian
Nguyen Huyen Chau
Noe Alexander
Sanders Peter
Schlag Sebastian
Stumpp Matthias
Sturm Tobias
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 01/01/2016
Field of study

We present the design and a first performance evaluation of Thrill -- a prototype of a general purpose big data processing framework with a convenient data-flow style programming interface. Thrill is somewhat similar to Apache Spark and Apache Flink with at least two main differences. First, Thrill is based on C++ which enables performance advantages due to direct native code compilation, a more cache-friendly memory layout, and explicit memory management. In particular, Thrill uses template meta-programming to compile chains of subsequent local operations into a single binary routine without intermediate buffering and with minimal indirections. Second, Thrill uses arrays rather than multisets as its primary data structure which enables additional operations like sorting, prefix sums, window scans, or combining corresponding fields of several arrays (zipping). We compare Thrill with Apache Spark and Apache Flink using five kernels from the HiBench suite. Thrill is consistently faster and often several times faster than the other frameworks. At the same time, the source codes have a similar level of simplicity and abstractio

arXiv.org e-Print Archive

Crossref

KITopen